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(5)  Introduction 

The  ability  of  cancer  cells  to  proliferate  inapproriately  is  due  the  loss  of  tumor 
suppressor  genes  and  the  gain  of  function  of  oncogenes.  Two  classes  of  tumor 
suppressor  genes  exist,  mutators  and  growth  regulators.  Mutators  are  genes  that  when 
mutant  cause  an  increase  in  the  rate  of  genomic  instability  and  hence  the  rapid  accumulation 
of  mutations  of  all  classes.  Genes  of  the  mutator  class  include  mis-match  repair  genes 
involved  in  HPCC,  DNA  repair  genes  such  as  X.P.,  cell  cycle  checkpoint  genes  such  as 
ATM  and  p53.  Recently  two  breast  cancer  genes  BRCA1  and  BRCA2  have  been 
implicated  in  DNA  repair  and  fall  into  this  class.  The  second  class  of  tumor  suppressors, 
growth  regulators,  are  those  directly  involved  in  regulating  cell  proliferation  or  the  ability  of 
tumor  cells  to  survive  and  metastasize.  Oncogenes  such  as  myc  or  ras  are  dominant  and  act 
in  opposition  to  tumor  suppressors.  Many  of  the  genes  in  these  groups  directly  or 
indirectly  regulate  transcription  and  the  identification  of  target  genes  is  important  for 
elucidation  of  these  regulatory  pathways.  Many  oncogenes  and  tumor  suppressors  are 
involved  in  signaling  pathways  that  activate  the  expression  of  a  particular  subset  of  relevant 
genes.  Furthermore,  the  transcriptional  induction  of  genes  in  response  to  DNA  damage  is  a 
critical  component  of  the  cellular  response  to  DNA  damage  and  the  prevention  of 
mutagenesis.  The  identification  of  transcriptional  target  genes  is  critical  to  understanding 
how  these  regulatory  pathways  function.  In  this  grant  we  propose  the  identification  of 
target  genes  that  are  differentially  transcribed  in  response  to  oncogenic  stimuli  or  DNA 
damage  to  better  our  understanding  of  these  processes  and  to  provide  tools  for  future 
analysis. 

(6)  Body 

We  have  proposed  to  identify  genes  that  are  transcriptionally  activated  in  response 
to  DNA  damage  or  potentially  in  response  to  oncogenic  stimuli  such  as  activation  of  ras  or 
loss  of  Rb.  We  initially  proposed  to  develop  new  technology  to  identify  such  genes  by 
using  retroviral  insertions  into  genes  that  would  place  GFP  under  the  control  of 
endogenous  promoters.  We  have  run  into  problems  with  the  design  of  these  vectors  as 
proposed  in  Task  1  and  Task  2. 

As  an  initial  test  of  our  concept,  we  placed  GFP1  >2,6  under  the  control  of  the 
retroviral  LTR.  This  construct  provided  a  strong  promoter  and  an  optimal  expression 
context  for  GFP.  We  were  able  to  successfully  package  this  control  vector  and  showed  that 
it  could  infect  cells.  We  next  infected  cells  containing  an  IRES  sequence  and  GFP  and 
could  show  that  IRES  GFP  was  able  to  express  well  when  run  from  the  LTR  promoter  as 
had  been  previously  shown  for  other  promoters  carrying  IRES  and  reporters  such  as  lacZ 
and  luciferase3>4>5.  We  next  tried  to  simulate  the  expression  from  and  insertion  into  an 
open  reading  frame.  We  did  this  by  inserting  a  long  cDNA  between  the  LTR  and  IRES- 
GFP.  The  long  cDNA  we  chose  was  the  coding  sequence  for  the  BRCA1  gene.  When  the 
construct  LTR-BRCA1-IRES-GFP  was  infected  into  cells,  we  were  unable  to  see  GFP 
expression.  To  test  whether  this  was  due  to  an  inability  to  properly  package  because  the 
construct  was  too  big,  we  transfected  the  construct  and  one  containing  only  the  LTR-IRES- 
GFP  as  a  control.  Under  these  circumstances,  the  LTR-IRES-GFP  construct  expressed 
GFP  well  but  the  LTR-BRCA1-IRES-GFP  failed  to  express  significant  levels  of  GFP. 
This  has  caused  us  to  question  the  overall  design  of  the  vectors  and  for  this  reason  we  have 
had  a  delay  in  making  the  vectors  we  proposed  in  Tasks  1  and  2. 

We  are  still  making  the  original  vector  now  using  as  a  backbone  some  previously 
tested  vectors.  To  do  this  we  have  modeled  these  vectors  on  the  previously  tested  ROSA-p- 
gal  vectors  used  for  promoter  trapping  in  ES  cells.  However,  we  feel  that  it  might  be  better 
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to  have  protein  fusions  rather  than  transcriptional  fusions  because  our  experience  with  long 
transcripts  in  front  of  the  IRES-GFP  construct  is  likely  to  mean  that  we  will  miss  a 
significant  number  of  genes  because  of  long  transcripts.  We  have  now  redesigned  our 
fusion  vectors  more  along  the  lines  originally  outlined  in  Task  2  with  a  modification  shown 
below  in  Figure  1 .  The  problem  with  the  translational  fusion  vectors  as  originally  designed 
is  that  the  frequency  of  productive  fusions  would  be  quite  low.  However,  we  have  come 
across  a  modification  that  will  significantly  improve  the  frequency  of  isolating  in  frame 
retroviral  insertions7’8.  This  modification  allows  one  to  select  for  the  initial  retrovirus  to 
insert  into  a  transcription  unit.  This  is  achieved  by  having  the  selectible  marker,  neo  in  this 
case,  to  have  a  splice  donor  sequence  at  the  3'  end  instead  of  a  terminal  exon  with  a  poly-A 
addition  site.  In  order  for  the  retrovirus  to  express  neo,  it  must  integrate  into  a  transcription 
unit  in  the  proper  orientation  to  splice  into  the  3'  terminal  exon.  This  means  that  each  neo- 
resistant  colony  contains  an  insertion  into  a  transcription  unit  in  the  correct  orientation  such 
that  the  5'  end  of  the  retrovirus  is  in  a  position  to  act  as  a  splice  acceptor  site  for  making 
fusions  between  GFP  and  the  endogenous  gene.  This  should  increase  our  efficiency  of 
insertion  to  productive  GFP  fusion  by  about  20-fold. 


Figure  1 


Retroviral 

Vector 


Target 

Gene 


Splice  Terminal  Splice 

Acceptor  Exon  Donor 


T 


Retroviral  insertion  produces  2  poly 
adenylated  mRNAs,  a  transcript 
encoding  a  Gene-GFP  fusion  and  a 
transcript  containing  a  Neo-Gene  fusion. 
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Since  the  goal  of  this  project  was  to  identify  DNA  damage  inducible  genes  from 
eukaryotes,  we  thought  it  would  be  useful  to  also  know  the  spectrum  of  DNA  damage- 
inducible  genes  in  a  model  eukaryote  S.  cerevisiae.  We  have  established  a  collaboration 
with  Pat  Brown's  lab  to  establish  the  complete  list  of  DNA  damage  inducible  genes  from 
yeast  using  chip  hybridization  technology9.  We  have  performed  a  number  of  analyses  and 
have  identified  approximately  300  transcripts  that  are  significantly  increased  in  response  to 
DNA  damage.  Many  of  these  have  obvious  human  homologs  that  we  can  test  for  DNA 
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damage-inducibility.  Interestingly  we  found  that  they  fall  into  several  categories  based  on 
time  of  induction  during  a  time  course  and  into  groups  of  genes  based  on  function.  These 
genes  are  shown  grouped  into  functional  units  in  Tables  1  and  2  below.  In  addition  to 
genes  involved  in  DNA  replication,  nucleotide  synthesis  and  DNA  repair  we  were 
surprised  to  see  a  number  of  genes  involved  in  electron  transport,  carbohydrate 
metabolism,  Amino  acid  biosynthesis,  protein  folding  and  degradation.  This  significantly 
changes  the  way  we  now  look  at  the  likely  genes  we  will  find  in  our  screen  for  mammalian 
damage  inducible  genes. 

Table  1 


Types  of  yeast  genes  induced  by  DNA  damage  by  MMS  through  a  2  hr  time  course. 


Total  number  of  number  of  genes  number  of  genes  number  of  genes 

genes  induced  over  peaking  at  15-30  min  peaking  at  45-60  min  peaking  at  90-120 

2-fold _ min _ 

Amino  Acid  Biosynthesis 

22 _ 14 _ 4 _ 4 _ 

Protein  Degradation 

39 _ 6 _ 13 _ 20 _ 

Protein  Folding 

7 _ 6 _ 1 _ 0 _ 

Transport 

20 _ 7 _ 4 _ 9 _ 

Nucleotide  synthesis 

L5 _ 0 _ 1 _ 14 _ 

Stress-induced  genes 

20 _ 3 _ 2 _ 15 _ 

Electron  transport 

8 _ 0 _ 2 _ 6 _ 

Carbohydrate  metabolism 

45 _ 0 _ 7 _ 38 _ 

DNA  replication  and  repair 

13 _ 0 _ 6 _ 7 _ 

Wild  type  cells  were  treated  with  0.03%MMS  (80%  viability) 


Table  2.  Names  of  4  classes  of  yeast  genes  induced  by  DNA  damage 


DNA  replication  and/or  repair 

DUN  1  Checkpoint  protein  kinase 
RNR 1  *  Ribonucleotide  reductase 
RNR2*  Ribonucleotide  reductase 
RNR4*  Ribonucleotide  reductase 
TRX2* 

MAGI  Methyl  transferase 
RFA1  DNA  Replication 
RFA2  DNA  Replication 
POL30  DNA  Replication 
PHR1  Photo-repair 
RAD51  Recombination 
TRR1 

YHR029C  *  (similar  to  thymidylate 
synthase) 


Nucleotide  Biosynthesis 

THI1 

THI5 

THI12 

TRR1 

TRR2 

UGP1 

YNK1 

ADE17 

STF2 

YBR284W  (similar  to  AMP  deaminase) 
RNR1* 

RNR2* 

RNR4* 
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TRX2* 

YHR029C  *(similar  to  thymidylate 
synthase) 


Stress-induced  genes 

Genes  involved  in  protein 
degradation 

SOD1  Superoxide  dismutase 

RPN6 

HYR1 

RPT4 

Y  AP 1  T  ranscription  factor 

RPT6 

YHB1 

RPN7 

TIR1 

RPT2 

XBP1  Transcription  factor 

RPN12 

TIR2 

RPT1 

HSP42  Heat  shock  protein 

RPN3 

HSP104  Heat  shock  protein 

RPT3 

HSP82  Heat  shock  protein 

RPT5 

HSP12  Heat  shock  protein 

PRE5 

HSP26  Heat  shock  protein 

PRE9 

SSE2  Heat  shock  protein 

PRE10 

STI1  Heat  shock  protein 

PUP3 

SIS  1  Chaperone 

PRE3 

YGP1 

PRE2 

US  VI 

PRE8 

DDR48 

PUP1 

GRE2 

PRE1 

GRE3 

UMP1 

UBA1 

UBC5 

UBC8 

UBC13 

UBI4 

UFD1 

YPS6 

YPS6 

APE2 

PRB1 

PEP4 

PRD1 

LAP4 

CDC48 

UBP6 

YAP180 

AUT7 

APG1 

APG7 

8 


Another  goal  of  this  project  is  to  look  at  genes  inducible  by  oncogenic  stimuli.  We 
are  still  planning  to  do  this  using  the  retrovirus  vectors  we  are  constructing.  However,  we 
have  also  taken  an  alternative  approach  to  identify  such  genes.  We  have  set  up  a 
collaboration  with  Dr.  Ronald  Davis's  lab10  to  use  human  cDNA  chips  made  by 
Affymetrics  to  look  at  the  expression  pattern  of  genes  in  wild-type  diploid  fibroblasts 
compared  to  the  same  fibroblasts  that  express  the  Adenovirus  E7  gene  that  inactivated  the 
retinoblastoma  protein.  Inactivation  of  Rb  is  normally  controlled  by  cyclin-dependent 
kinases  and  cyclin-dependent  kinase  activation  is  a  downstream  event  in  many  oncogenic 
pathways.  We  have  compared  the  gene  expression  profiles  for  6000  human  cDNAs  from 
these  two  cell  lines  throughout  the  cell  cycle.  We  have  found  clusters  of  coordinately 
regulated  genes  through  the  cell  cycle  and  the  genes  maximally  expressed  in  G2/M  and  in  S 
phase  are  shown  in  Tables  3  and  4.  We  are  currently  analyzing  these  genes  for  differences 
between  the  E7  expressing  cell  and  normal  fibroblasts.  We  do  not  have  this  data  yet. 


Table  3  Human  genes  expressed  maximally  in  G2/M. 

Accession  Number  Gene  function _ 


X66945_at  3 

X54942_at  3 

X79201_at  3 

Z36714_at  3 

X87613_at  3 

Z49989_at  3 

L25876_at  3 

M62994_at  3 

U47635_at  3 

X14850_at  3 

U14518_at  3 

M25753_at  3 

U73379_at  3 

M96803_at  3 

M86699_at  3 

U37426_at  3 

D26361_at  3 

D31885_at  3 

D43948_at  3 

X51688_at  3 

D785 14_at  3 
U63743_at  3 

U28386_at  3 

U05340_at  3 

U01038_at  3 

M90657_at  3 

M91670_at  3 


H.sapiens  N-sam  mRNA  for  fibroblast  growth  factor  receptor 

H.  sapiens  ckshs2  mRNA  for  Cksl  protein  homologue 

H.sapiens  mRNA  for  SYT 

H.sapiens  mRNA  for  cyclin  F 

H.sapiens  mRNA  for  skeletal  muscle  abundant  protein 

H.sapiens  mRNA  for  smoothelin 

Homo  sapiens  protein  tyrosine  phosphatase  (CIP2)mRNA, 

Homo  sapiens  thyroid  autoantigen  (truncated  actin-binding  protein) 

Human  D13S824E  locus  mRNA,  complete  cds 

Human  H2A.X  mRNA  encoding  histone  H2A.X 

Human  centromere  protein-A  (CENP-A)  mRNA,  complete  cds 

Human  cyclin  B  mRNA,  3'  end 

Human  cyclin-selective  ubiquitin  carrier  protein  mRNA, 

Human  general  beta-spectrin  (SPTBN1)  mRNA,  complete  cds 

Human  kinase  (TTK)  mRNA,  complete  cds 

Human  kinesin-like  spindle  protein  HKSP  (HKSP)  mRNA, 

Human  mRNA  for  KIAA0042  gene,  complete  cds 
Human  mRNA  for  KIAA0069  gene,  partial  cds 
Human  mRNA  for  KIAA0097  gene,  complete  cds 
Human  mRNA  for  cyclin  A 

Human  mRNA  for  ubiquitin-conjugating  enzyme,  complete  cds 

Human  mitotic  centromere-associated  kinesin  mRNA,  complete  cds. 

Human  nuclear  localization  sequence  receptor  hSRPl  alpha  mRNA, 

Human  p55CDC  mRNA,  APC  activator 

Human  pLK  mRNA,  complete  cds 

Human  tumor  antigen  (L6)  mRNA,  complete  cds 

Human  ubiquitin  carrier  protein  (E2-EPF)  mRNA,  complete  cds 
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Table  4.  Human  genes  maximally  expressed  during  S  phase. 

Accession  Number  Gene  function 


X68277_at  7 

X66087_at  7 

X89750_at  7 

Z46967_at  7 

X62535_at  7 

Z24725_at  7 

M77140_at  7 

Z24727_at  7 

U62015_at  7 

Z46629_at  7 

J02854_at  7 

U73960_at  7 

U44975_at  7 

U27655_at  7 

U27768_at  7 

U59752_at  7 

U28049_at  7 

X16416_at  7 

M92934_at  7 

U04636_at  7 

U18300_at  7 

U81607_at  7 

X52142_at  7 

D90209_at  7 

X52599_at  7 

X58377_at  7 

L08246_at  7 

U08021_at  7 

U40490_at  7 

M12174_at  7 

M6283  l_at  7 
J03764_at  7 


H.sapiens  CL  100  mRNA  for  protein  tyrosine  phosphatase 
H.  sapiens  a-myb  mRNA 
H.sapiens  mRNA  for  TGIF  protein 
H.sapiens  mRNA  for  calicin  (partial) 

H.sapiens  mRNA  for  diacylglycerol  kinase 

H.sapiens  mitogen  inducible  gene  mig-2,  complete  CDS 

H.sapiens  pro-galanin  mRNA,  3'  end 

H.sapiens  tropomyosin  isoform  mRNA,  complete  CDS 

Homo  sapiens  Cyr61  mRNA,  complete  cds 

Homo  sapiens  SOX9  mRNA 

Human  20-kDa  myosin  light  chain  (MLC-2)  mRNA,  complete  cds 

Human  ADP-ribosylation  factor-like  protein  4  mRNA,  complete  cds 

Human  DNA-binding  protein  CPBP  (CPBP)  mRNA,  partial  cds 

Human  RGP3  mRNA,  complete  cds 

Human  RGP4  mRNA,  complete  cds 

Human  Sec7p-like  protein  mRNA,  partial  cds 

Human  TBX2  (TXB2)  mRNA,  complete  cds 

Human  c-abl  mRNA  encoding  pi 50  protein 

Human  connective  tissue  growth  factor,  complete  cds 

Human  cyclooxygenase-2  (hCox-2)  gene,  complete  cds 

Human  damage-specific  DNA  binding  protein  p48  subunit  (DDB2) 

Human  gravin  mRNA,  complete  cds 

Human  mRNA  for  CTP  synthetase  (EC  6.3.4.2) 

Human  mRNA  for  DNA  binding  protein  TAXREB67 
Human  mRNA  for  beta  nerve  growth  factor 
Human  mRNa  for  adipogenesis  inhibitory  factor 
Human  myeloid  cell  differentiation  protein  (MCL1)  mRNA 
Human  nicotinamide  N-methyl transferase  (NNMT)  mRNA, 

Human  nicotinamide  nucleotide  transhydrogenase  mRNA, 

Human  ras-related  rho  mRNA  (clone  6),  partial  cds 
Human  transcription  factor  ETR101  mRNA,  complete  cds 
Human,  plasminogen  activator  inhibitor- 1  gene,  exons  2  to  9 


(7)  Key  Research  Accomplishments 

A)  Determination  that  IRES-GFP  can  be  expressed  from  a  retroviral  vector  in  mouse  cells 

B)  Determination  that  long  cDNAs  upstream  of  IRES-GFP  constructs  significantly  reduces 
the  amount  of  expression. 

C)  Identification  of  all  known  DNA  damage  inducible  genes  in  the  yeast  S.  cerevisiae. 


(8)  Reportable  Outcomes. 

None  yet. 
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(9)  Conclusions 

So  far  our  work  has  established  that  we  can  work  with  GFP  expressing 
retroviruses  and  that  the  expression  of  an  IRES-GFP  fusion  depends  upon  its  proximity  to 
the  transcriptional  start.  We  have  decided  to  focus  more  on  the  translational  fusion  versions 
of  the  retrovirus  system  to  get  around  these  problems.  We  have  also  identified  a  number  of 
DNA  damage-inducible  genes  in  a  lower  eukaryote,  S.  cerevisiae  and  these  will  serve  as  a 
background  upon  which  to  compare  the  human  DNA  damage  inducible  genes  we  obtain 
from  our  screen  using  the  retroviruses  proposed.  The  control  experiment  for  looking  at 
human  cDNAs  that  differ  between  wild-type  fibroblasts  and  fibroblasts  expressing  the  E7 
protein  have  been  working  and  will  help  us  identify  genes  whose  expression  is  activated  or 
repressed  by  Rb, 
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